Sidekick agents for sequential planning problems

نویسنده

  • Owen Macindoe
چکیده

Effective Al sidekicks must solve the interlinked problems of understanding what their human collaborator's intentions are and planning actions to support them. This thesis explores a range of approximate but tractable approaches to planning for AI sidekicks based on decision-theoretic methods that reason about how the sidekick's actions will effect their beliefs about unobservable states of the world, including their collaborator's intentions. In doing so we extend an existing body of work on decisiontheoretic models of assistance to support information gathering and communication actions. We also apply Monte Carlo tree search methods for partially observable domains to the problem and introduce an ensemble-based parallelization strategy. These planning techniques are demonstrated across a range of video game domains. Thesis Supervisor: Leslie Pack Kaelbling Title: Professor Thesis Supervisor: Tomis Lozano-Perez Title: Professor

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sarah and Sally: Creating a Likeable and Competent AI Sidekick for a Videogame

Creating reasonable AI for sidekicks in games has proven to be a difficult challenge synthetizing player modelling and cooperative planning, both being problems hard by themselves. In this paper, we experiment with designing around these problems: we propose a cooperative puzzle-platformer game that was designed to look similarly to the mainstream of the genre, but to allow for an easy implemen...

متن کامل

Synaptic localization and function of Sidekick recognition molecules require MAGI scaffolding proteins.

Four transmembrane adhesion molecules-Sidekick-1, Sidekick-2, Down's syndrome cell adhesion molecule (Dscam), and Dscam-like-are determinants of lamina-specific synapse formation in the vertebrate retina. Their C termini are predicted to bind postsynaptic density (PSD)-95/Discs Large/ZO-1 (PDZ) domains, which are present in many synaptic scaffolding proteins. We identify members of the membrane...

متن کامل

PRUDENT: A Sequential-Decision-Making Framework for Solving Industrial Planning Problems

Planning and control are critical problems in industry. In this paper, we propose a planning framework called PRUDENT to address many common issues and challenges we are facing in industrial applications, including incompletely known world models, uncertainty, and very large problem spaces. This framework considers planning as sequential decision-making and applies integrated planning and learn...

متن کامل

Collective Multiagent Sequential Decision Making Under Uncertainty

Multiagent sequential decision making has seen rapid progress with formal models such as decentralized MDPs and POMDPs. However, scalability to large multiagent systems and applicability to real world problems remain limited. To address these challenges, we study multiagent planning problems where the collective behavior of a population of agents affects the joint-reward and environment dynamic...

متن کامل

On the Planning Problem in Sequential Control

Sequential control is a common control problem in industry. Despite its importance fairly little theoretical research has been devoted to this problem. We study a subclass of sequential control problems, which we call the SAS-PUBS class, and present a planning algorithm for this class. The algorithm is developed using formalism from artiicial intelligence (AI). For planning problems in the SAS-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013